'tyflSSr SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Caras, Ingrid W 
(ii) TITLE OF INVENTION: A2-1 Neurotrophic Factor 
5 (iii) NUMBER OF SEQUENCES: 8 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Genentech, Inc. 

<B) STREET: 460 Point San Bruno Blvd 

(C) CITY: South San Francisco 

10 (D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 94080 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3.5 inch, 1.44 Mb floppy disk 
15 (B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 
<D) SOFTWARE: WinPatin (Genentech) 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/635130 
20 (B) FILING DATE: 19-Mar-1996 

(C) CLASSIFICATION: 



(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Torchia, PhD., Timothy E. 

(B) REGISTRATION NUMBER: 36,700 

25 (C) REFERENCE/DOCKET NUMBER: P1001 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 415/225-8674 

(B) TELEFAX: 415/952-9881 

(C) TELEX: 910/371-7168 



30 (2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1877 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 



(ix) FEATURE: 

(A) NAME/KEY: Extra Cellular Domain 

(B) LOCATION: 244-899 

(C) IDENTIFICATION METHOD: 
40 (D) OTHER INFORMATION: 



1 



(ix) FEATURE: 

(A) NAME /KEY : Transmembrane Domain 

(B) LOCATION: 901-978 

(C) IDENTIFICATION METHOD: 
5 (D) OTHER INFORMATION: 

(ix) FEATURE: 

(A) NAME/KEY: signal peptide 

(B) LOCATION: 244-321 

<C) IDENTIFICATION METHOD: 
10 (D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



GNTCTAGAAN TAGTGGATCC CCCCGGGCTG CAGGAATTCC GACGGCCCCT 50 

GGAAGGGCTC TGGTGGGGCT GAGCGCTCTG CCGCGGGGGC GCGGGCACAG 100 

CAGGAAGCAG GTCCGCGTGG GCGCTGGGGG CATCAGCTAC CGGGGTGGTC 150 

15 CGGGCTGAAG AGCCAGGCAG CCAAGGCAGC CACCCCGGGG GGTGGGCGAC 200 

TTTGGGGGAG TTGGTGCCCC GCCCCCCAGG CCTTGGCGGG GTC ATG 246 

Met 
1 

GGG CCC CCC CAT TCT GGG CCG GGG GGC GTG CGA GTC GGG 285 
20 Gly Pro Pro His Ser Gly Pro Gly Gly Val Arg Val Gly 

5 10 

GCC CTG CTG CTG CTG GGG GTT TTG GGG CTG GTG TCT GGG 324 
Ala Leu Leu Leu Leu Gly Val Leu Gly Leu Val Ser Gly 
15 20 25 

25 CTC AGC CTG GAG CCT GTC TAC TGG AAC TCG GCG AAT AAG 363 
Leu Ser Leu Glu Pro Val Tyr Trp Asn Ser Ala Asn Lys 
30 35 40 

AGG TTC CAG GCA GAG GGT GGT TAT GTG CTG TAC CCT CAG 402 
Arg Phe Gin Ala Glu Gly Gly Tyr Val Leu Tyr Pro Gin 
30 45 50 

ATC GGG GAC CGG CTA GAC CTG CTC TGC CCC CGG GCC CGG 441 
lie Gly Asp Arg Leu Asp Leu Leu Cys Pro Arg Ala Arg 
55 60 65 

CCT CCT GGC CCT CAC TCC TCT CCT AAT TAT GAG TTC TAC 480 
35 Pro Pro Gly Pro His Ser Ser Pro Asn Tyr Glu Phe Tyr 
70 75 

AAG CTG TAC CTG GTA GGG GGT GCT CAG GGC CGG CGC TGT 519 
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+ 



Lys Leu 
80 

GAG GCA 
Glu Ala 

5 

CGC CCA 
Arg Pro 



GAG TAT 
10 Glu Tyr 
120 

CAC CAC 
His His 



15 CGG GAG 
Arg Glu 
145 

ACC AGA 
Thr Arg 

20 

CCC CGA 
Pro Arg 



ATG CCC 
25 Met Pro 
185 

GAG CCT 
Glu Pro 



30 AAT GCA 
Asn Ala 
210 

CCC AGC 
Pro Ser 

35 

GCG CTG 
Ala Leu 



ATG TGT 



Tyr Leu Val 



CCC CCT GCC 
Pro Pro Ala 
95 

GAC CTG GAT 
Asp Leu Asp 
110 

AGC CCT AAT 
Ser Pro Asn 



GAT TAC TAC 
Asp Tyr Tyr 
135 

GGC CTG GAG 
Gly Leu Glu 



GGC ATG AAG 
Gly Met Lys 
160 

GGA GGG GCT 
Gly Gly Ala 
175 

ATG GAA AGA 
Met Glu Arg 



GGG AAG GAG 
Gly Lys Glu 
200 

ACC TCC CGG 
Thr Ser Arg 



ATG CCT GCA 
Met Pro Ala 
225 

CTC TTG CTG 
Leu Leu Leu 
240 

TGG CGG AGA 



Gly Gly Ala 
85 

CCA AAC CTC 
Pro Asn Leu 
100 

CTC CGC TTC 
Leu Arg Phe 

CTC TGG GGC 
Leu Trp Gly 
125 

ATC ATT GCC 
He He Ala 



AGC CTG CAG 
Ser Leu Gin 
150 

GTG CTT CTC 
Val Leu Leu 
165 

GTC CCC CGA 
Val Pro Arg 



GAC CGA GGG 
Asp Arg Gly 
190 

AAC CTG CCA 
Asn Leu Pro 



GGT GCT GAA 
Gly Ala Glu 
215 

GTG GCT GGG 
Val Ala Gly 
230 

GGC GTG GCA 
Gly Val Ala 



CGG CGG GCC 



Gin Gly Arg 
90 

CTT CTC ACT 
Leu Leu Thr 



ACC ATC AAG 
Thr He Lys 
115 

CAC GAG TTC 
His Glu Phe 



ACA TCG GAT 
Thr Ser Asp 
140 

GGA GGT GTG 
Gly Gly Val 
155 

CGA GTG GGA 
Arg Val Gly 



AAA CCT GTG 
Lys Pro Val 
180 

GCA GCC CAC 
Ala Ala His 



GGT GAC CCC 
Gly Asp Pro 
205 

GGC CCC CTG 
Gly Pro Leu 
220 

GCA GCA GGG 
Ala Ala Gly 



GGG GCT GGG 
Gly Ala Gly 
245 

AAG CCT TCG 



Arg Cys 

TGT GAT 558 
Cys Asp 
105 

TTC CAG 597 
Phe Gin 



CGC TCG 63 6 
Arg Ser 
130 

GGG ACC 67 5 
Gly Thr 

TGC CTA 714 
Cys Leu 



CAA AGT 753 
Gin Ser 
170 

TCT GAA 792 
Ser Glu 



AGC CTG 831 
Ser Leu 
195 

ACC AGC 870 
Thr Ser 



CCC CCT 909 
Pro Pro 



GGG CTG 948 
Gly Leu 
235 

GGT GCC 987 
Gly Ala 



GAG AGT 1026 



3 



Met Cys 
250 

CGC CAC 
Arg His 

5 

CTG GGC 
Leu Gly 
275 

GCT GAG 
10 Ala Glu 



GGG GCT 
Gly Ala 



15 GTG AGT 
Val Ser 
315 

GAT GGG 
Asp Gly 

20 

TCG ATT 
Ser lie 
340 

CAA CTG 
25 Gin Leu 



ACA TTC 
Thr Phe 



3 0 TGC CGG 
Cys Arg 
380 

TCC ATG 
Ser Met 

35 

ATC AGA 
lie Arg 
405 

GCT CTG 



Trp Arg Arg 



CCT GGT CCT 
Pro Gly Pro 
265 

CTG GGG GGT 
Leu Gly Gly 



CCT GGG GAG 
Pro Gly Glu 
290 

GCA GAT CCC 
Ala Asp Pro 
305 

GGT GAC TAT 
Gly Asp Tyr 



CCC CCC CAG 
Pro Pro Gin 
330 

TCT GTG TTG 
Ser Val Leu 



TTT TTC ATG 
Phe Phe Met 
355 

TTA TTT CCT 
Leu Phe Pro 
370 

ATG ACT TCA 
Met Thr Ser 



CAG GCC TGC 
Gin Ala Cys 
395 

TGG TGT TTC 
Trp Cys Phe 



TTT GTG CTT 



Arg Arg Ala 
255 

GGC TCC TTC 
Gly Ser Phe 



GGA GGT GGG 
Gly Gly Gly 
280 

CTA GGG ATA 
Leu Gly lie 
295 

CCC TTC TGC 
Pro Phe Cys 



GGG CAT CCT 
Gly His Pro 
320 

AGC CCT CCA 
Ser Pro Pro 



GAG TGG CCC 
Glu Trp Pro 
345 

CGA TCC AAG 
Arg Ser Lys 
360 

GTG CAA GTT 
Val Gin Val 



TTT AGC TTT 
Phe Ser Phe 
385 

AGA GCA CAG 
Arg Ala Gin 



TGG GGG GAC 
Trp Gly Asp 
410 

GTG CTT ATT 



Lys Pro Ser 



GGG AGG GGA 
Gly Arg Gly 
270 

ATG GGA CCT 
Met Gly Pro 
285 

GCT CTG CGG 
Ala Leu Arg 



CCC CAC TAT 
Pro His Tyr 
310 

GTG TAT ATC 
Val Tyr He 



AAC ATC TAC 
Asn He Tyr 
335 

ATA TTG CAT 
He Leu His 
350 

TGC TCC CGT 
Cys Ser Arg 



ATT ACG ACA 
He Thr Thr 
375 

ACC ACC CTG 
Thr Thr Leu 



ATG GGG GAA 
Met Gly Glu 
400 

AGG ATC CTG 
Arg He Leu 
415 

CTT CTT CTT 



Glu Ser 
260 

GGG TCT 1065 
Gly Ser 



CGG GAG 1104 
Arg Glu 



GGT GGC 1143 
Gly Gly 
300 

GAG AAG 1182 
Glu Lys 



GTG CAG 1221 
Val Gin 
325 

TAC ACA 12 60 
Tyr Thr 



ACG ATA 12 99 
Thr He 



GTC ACT 1338 
Val Thr 
365 

TCG ACT 1377 
Ser Thr 



AAC CCA 1416 
Asn Pro 
390 

TTC CGA 1455 
Phe Arg 



GGT ACG 1494 
Gly Thr 



GGG AGG 1533 
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Ala Leu Phe Val Leu Val Leu lie Leu Leu Leu Gly Arg 
420 425 430 

CTG AAT ATG CAT CAG ACG ACA CTG CTC CGG CAA CGG GCC 1572 
Leu Asn Met His Gin Thr Thr Leu Leu Arg Gin Arg Ala 
5 435 440 

AGT GTG GAG GCG GAA GCC GGC CAG CAT GGT CCC CTG TG 1610 
Ser Val Glu Ala Glu Ala Gly Gin His Gly Pro Leu 
445 450 455 

ATAGGATTGA AAGAGCTACT GAGAATAGGG GGCTTCTCAA TGAGAGAGCG 1660 

10 GAGGCTGCTG TTATCATGGG AACCAGGCAG ATCAATCATC CCTGGCAGGT 1710 

CAGGCAGGAA GTTACTTAGC TTCTCCTTCA CCTTCTTCCC ACAGAATTTA 1760 

TTATAGGCTT GTTCCAAGTT GTAGTGTGTG ATCAGATTCG TGCTGCCTGT 1810 

CAGCTCTGTG CTACCTGGCA GTTCCCCTCA TGGAATTCGA TATCAAGCTT 1860 

ATCGATACCG TCGACCT 1877 

15 (2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 455 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Gly Pro Pro His Ser Gly Pro Gly Gly Val Arg Val Gly Ala 
15 10 15 

Leu Leu Leu Leu Gly Val Leu Gly Leu Val Ser Gly Leu Ser Leu 
20 25 30 

25 Glu Pro Val Tyr Trp Asn Ser Ala Asn Lys Arg Phe Gin Ala Glu 

35 40 45 

Gly Gly Tyr Val Leu Tyr Pro Gin lie Gly Asp Arg Leu Asp Leu 
50 55 60 

Leu Cys Pro Arg Ala Arg Pro Pro Gly Pro His Ser Ser Pro Asn 
30 65 70 75 

Tyr Glu Phe Tyr Lys Leu Tyr Leu Val Gly Gly Ala Gin Gly Arg 
80 85 90 

Arg Cys Glu Ala Pro Pro Ala Pro Asn Leu Leu Leu Thr Cys Asp 
95 100 105 
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Arg Pro Asp Leu Asp Leu Arg Phe Thr lie Lys Phe Gin Glu Tyr 
110 115 120 

Ser Pro Asn Leu Trp Gly His Glu Phe Arg Ser His His Asp Tyr 
125 130 135 

5 Tyr lie lie Ala Thr Ser Asp Gly Thr Arg Glu Gly Leu Glu Ser 

140 145 150 

Leu Gin Gly Gly Val Cys Leu Thr Arg Gly Met Lys Val Leu Leu 
155 160 165 

Arg Val Gly Gin Ser Pro Arg Gly Gly Ala Val Pro Arg Lys Pro 
10 170 175 180 

Val Ser Glu Met Pro Met Glu Arg Asp Arg Gly Ala Ala His Ser 
185 190 195 

Leu Glu Pro Gly Lys Glu Asn Leu Pro Gly Asp Pro Thr Ser Asn 
200 205 210 

15 Ala Thr Ser Arg Gly Ala Glu Gly Pro Leu Pro Pro Pro Ser Met 

215 220 225 

Pro Ala Val Ala Gly Ala Ala Gly Gly Leu Ala Leu Leu Leu Leu 
230 235 240 

Gly Val Ala Gly Ala Gly Gly Ala Met Cys Trp Arg Arg Arg Arg 
20 245 250 255 

Ala Lys Pro Ser Glu Ser Arg His Pro Gly Pro Gly Ser Phe Gly 
260 265 270 

Arg Gly Gly Ser Leu Gly Leu Gly Gly Gly Gly Gly Met Gly Pro 
275 280 285 

25 Arg Glu Ala Glu Pro Gly Glu Leu Gly lie Ala Leu Arg Gly Gly 

290 295 300 

Gly Ala Ala Asp Pro Pro Phe Cys Pro His Tyr Glu Lys Val Ser 
305 310 315 

Gly Asp Tyr Gly His Pro Val Tyr lie Val Gin Asp Gly Pro Pro 
30 320 325 330 

Gin Ser Pro Pro Asn lie Tyr Tyr Thr Ser lie Ser Val Leu Glu 
335 340 345 

Trp Pro lie Leu His Thr lie Gin Leu Phe Phe Met Arg Ser Lys 
350 355 360 

3 5 Cys Ser Arg Val Thr Thr Phe Leu Phe Pro Val Gin Val He Thr 
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365 370 375 

Thr Ser Thr Cys Arg Met Thr Ser Phe Ser Phe Thr Thr Leu Asn 
380 385 390 

Pro Ser Met Gin Ala Cys Arg Ala Gin Met Gly Glu Phe Arg lie 
5 395 400 405 

Arg Trp Cys Phe Trp Gly Asp Arg lie Leu Gly Thr Ala Leu Phe 
410 415 420 

Val Leu Val Leu lie Leu Leu Leu Gly Arg Leu Asn Met His Gin 
425 430 435 

10 Thr Thr Leu Leu Arg Gin Arg Ala Ser Val Glu Ala Glu Ala Gly 

440 445 450 

Gin His Gly Pro Leu 
455 

(2) INFORMATION FOR SEQ ID NO : 3 : 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 80 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 



GNTCTAGAAN TAGTGGATCC CCCCGGGCTG CAGGAATTCC GACGGCCCCT 50 

GGAAGGGCTC TGGTGGGGCT GAGCGCTCTG CCGCGGGGGC GCGGGCACAG 100 

CAGGAAGCAG GTCCGCGTGG GCGCTGGGGG CATCAGCTAC CGGGGTGGTC 150 

CGGGCTGAAG AGCCAGGCAG CCAAGGCAGC CACCCCGGGG GGTGGGCGAC 200 

2 5 TTTGGGGGAG TTGGTGCCCC GCCCCCCAGG CCTTGGCGGG GTCATGGGGC 250 

CCCCCCATTC TGGGCCGGGG GGCGTGCGAG TCGGGGCCCT GCTGCTGCTG 3 00 

GGGGTTTTGG GGCTGGTGTC TGGGCTCAGC CTGGAGCCTG TCTACTGGAA 3 50 

CTCGGCGAAT AAGAGGTTCC AGGCAGAGGG TGGTTATGTG CTGTACCCTC 400 

AGATCGGGGA CCGGCTAGAC CTGCTCTGCC CCCGGGCCCG GCCTCCTGGC 450 

30 CCTCACTCCT CTCCTAATTA TGAGTTCTAC AAGCTGTACC TGGTAGGGGG 500 

TGCTCAGGGC CGGCGCTGTG AGGCACCCCC TGCCCCAAAC CTCCTTCTCA 550 
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CTTGTGATCG CCCAGACCTG GATCTCCGCT TCACCATCAA GTTCCAGGAG 600 
TATAGCCCTA ATCTCTGGGG CCACGAGTTC CGCTCGCACC ACGATTACTA 650 
CATCATTGCC ACATCGGATG GGACCCGGGA GGGCCTGGAG AGCCTGCAGG 700 
GAGGTGTGTG CCTAACCAGA GGCATGAAGG TGCTTCTCCG AGTGGGACAA 750 
5 AGTCCCCGAG GAGGGGCTGT CCCCCGAAAA CCTGTGTCTG AAATGCCCAT 800 
GGAAAGAGAC CGAGGGGCAG CCCACAGCCT GGAGCCTGGG AAGGAGAACC 850 
TGCCAGGTGA CCCCACCAGC AATGCAACCT CCCGGGGTGC TGAAGGCCCC 900 
CTGCCCCCTC CCAGCATGCC TGCAGTGGCT GGGGCAGCAG GGGGGCTGGC 950 
GCTGCTCTTG CTGGGCGTGG CAGGGGCTGG GGGTGCCATG TGTTGGCGGA 1000 

10 GACGGCGGGC CAAGCCTTCG GAGAGTCGCC ACCCTGGTCC TGGCTCCTTC 1050 
GGGAGGGGAG GGTCTCTGGG CCTGGGGGGT GGAGGTGGGA TGGGACCTCG 1100 
GGAGGCTGAG CCTGGGGAGC TAGGGATAGC TCTGCGGGGT GGCGGGGCTG 1150 
CAGATCCCCC CTTCTGCCCC CACTATGAGA AGGTGAGTGG TGACTATGGG 1200 
CATCCTGTGT ATATCGTGCA GGATGGGCCC CCCCAGAGCC CTCCAAACAT 12 50 

15 CTACTACAAG GTATGAGGGC TCCTCTCACG TGGCTATCCT GAATCCAGCC 1300 
CTTCTTGGGG TGCTCCTCCA GTTTAATTCC TGGTTTGAGG GACACCTCTA 1350 
ACATCTCGGC CCCCTGTGCC CCCCCAGCCC CTTCACTCCT CCCGGCTGCT 1400 
GTCCTCGTCT CCACTTTTAG GATTCCTTAG GATTCCCACT GCCCCACTTC 1450 
CTGCCCTCCC GTTTGGCCAT GGGTGCCCCC CTCTGTCTCA GTGTCCCTGG 1500 

20 ATCCTTTTTC CTTGGGGAGG GGCACAGGCT CAGCCTCCTC TCTGACCATG 1550 
ACCCAGGCAT CCTTGTCCCC CTCACCCACC CAGAGCTAGG GGCGGGAACA 1600 
GCCCACCTTT TGGTTGGCAC CGCCTTCTTT CTGCCTCTCA CTGGTTTTCT 1650 
CTTCTCTATC TCTTATTCTT TCCCTCTCTT CCGTCTCTAG GTCTGTTCTT 1700 
CTTCCCTAGC ATCCTCCTCC CCACATCTCC TTTCACCCTC TTGGCTTCTT 17 50 

2 5 ATCCTGTGCC TCTCCCATCT CCTGGGTGGG GGCATCAAAG CATTTCTCCC 1800 
CTTAGCTTTC AGCCCCCCTT CTGACCTCTC ATACCAACCA CTCCCCTCAG 1850 
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TCTGCCAAAA ATGGGGGCCT TATGGGGAAG GCTCTGACAC TCCACCCCAG 1900 

CTCAGGCCAT GGGCAGCAGG GCTCCATTCT CTGGCCTGGC CCAGGCCTCT 1950 

ACATACTTAC TCCAGCCATT TGGGGTGGTT GGGTCATGAC AGCTACCATG 2000 

AGAAGAAGTG TCCCGTTTTG TCCAGTGGCC AATAGCAAGA TATGAACCGG 2050 

5 TCGGGACATG TATGGACTTG GTCTGATGCT GAATGGGCCA CTTGGGACCG 2100 

GAAGTGACTT GCTCCAGACA AGAGGTGACC AGGCCCGGAC AGAAATGGCC 2150 

TGGGAAGTAG CAGAAGCAGT GCAGCAGGAA CTGGAAGTGC CTTCATCCAG 2200 

GACAGGAAGT AGCACTTCTG AAACAGGAAG TGGTCTGGCT GGAACTCCAA 2250 

GTGGCTTAGT CTGGGGGATC AGGAGGTGGG AGGTGGATGG TTCTTATTCT 23 00 

10 GTGGAGAAGA AGGGCGGGAA GAACTTCCTT TCAGGAGGAA GCTGGAACTT 2350 

ACTGACTGTA AGAGGTTAGA GGTGGACCGA 23 80 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 340 amino acids 
15 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Gly Pro Pro His Ser Gly Pro Gly Gly Val Arg Val Gly Ala 
15 10 15 

2 0 Leu Leu Leu Leu Gly Val Leu Gly Leu Val Ser Gly Leu Ser Leu 

20 25 30 

Glu Pro Val Tyr Trp Asn Ser Ala Asn Lys Arg Phe Gin Ala Glu 
35 40 45 

Gly Gly Tyr Val Leu Tyr Pro Gin lie Gly Asp Arg Leu Asp Leu 
25 50 55 60 

Leu Cys Pro Arg Ala Arg Pro Pro Gly Pro His Ser Ser Pro Asn 
65 70 75 

Tyr Glu Phe Tyr Lys Leu Tyr Leu Val Gly Gly Ala Gin Gly Arg 
80 85 90 

30 Arg Cys Glu Ala Pro Pro Ala Pro Asn Leu Leu Leu Thr Cys Asp 

95 100 105 
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Arg Pro Asp Leu Asp Leu Arg Phe Thr lie Lys Phe Gin Glu Tyr 
110 115 120 

Ser Pro Asn Leu Trp Gly His Glu Phe Arg Ser His His Asp Tyr 
125 130 135 

5 Tyr lie lie Ala Thr Ser Asp Gly Thr Arg Glu Gly Leu Glu Ser 

140 145 150 

Leu Gin Gly Gly Val Cys Leu Thr Arg Gly Met Lys Val Leu Leu 
155 160 165 

Arg Val Gly Gin Ser Pro Arg Gly Gly Ala- Val Pro Arg Lys Pro 
10 170 175 180 

Val Ser Glu Met Pro Met Glu Arg Asp Arg Gly Ala Ala His Ser 
185 190 195 

Leu Glu Pro Gly Lys Glu Asn Leu Pro Gly Asp Pro Thr Ser Asn 
200 205 210 

15 Ala Thr Ser Arg Gly Ala Glu Gly Pro Leu Pro Pro Pro Ser Met 

215 220 225 

Pro Ala Val Ala Gly Ala Ala Gly Gly Leu Ala Leu Leu Leu Leu 
230 235 240 

Gly Val Ala Gly Ala Gly Gly Ala Met Cys Trp Arg Arg Arg Arg 
20 245 250 255 

Ala Lys Pro Ser Glu Ser Arg His Pro Gly Pro Gly Ser Phe Gly 
260 265 270 

Arg Gly Gly Ser Leu Gly Leu Gly Gly Gly Gly Gly Met Gly Pro 
275 280 285 

25 Arg Glu Ala Glu Pro Gly Glu Leu Gly lie Ala Leu Arg Gly Gly 

290 295 300 

Gly Ala Ala Asp Pro Pro Phe Cys Pro His Tyr Glu Lys Val Ser 
305 310 315 

Gly Asp Tyr Gly His Pro Val Tyr lie Val Gin Asp Gly Pro Pro 
30 320 325 330 

Gin Ser Pro Pro Asn lie Tyr Tyr Lys Val 
335 340 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 454 base pairs 
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(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

5 GCCGACGCTG TGAGGCACCC CCTGCCCCAA ACCTCCTTCT CACTTGTGAT 50 
CGCCCAGACC TGGATCTCCG CTTCACCATC AAGTTCCAGG AGTATAGCCC 100 
TAATCTCTGG GGCCACGAGT TCCGCTCGCA CCACGATTAC TACATCATTG 150 
CCACATCGGA TGGGACCCGG GAGGCCTGGG AGAGCCTGCA GGGAAGTGTG 200 
TGCCTAACCA GAGGCATGAA GGTGCTTCTC CGAGTNGGAC AAAGTCCCGA 250 
10 GGAGGGGCTG TCCCCCGAAA ACCTGTGTCT GAAATGCCCA TGGAAAGAGA 300 
CCGAGGGGCA GCCCACAGCC TGGGAGCCTG GGGAAGGAGA ACCTGCCAGG 350 
TGACCCCACC AGCAATNCAA CCTTCCGGGG TTGCTTGAAG GGCCCCTTGA 400 
CCCTTTCCCA GCATTGCNTG CANTTGGTTN GGGGCAGCAN GGGGGNGTTT 450 
TGGC 454 

15 (2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Asp Lys Thr His Thr Cys Pro Pro Cys Pro 
15 10 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 60 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
30 GGACAAAGTC CCGAGGAGGG GCTGTCCCCC GAAAACCTGT GTCTGAAATG 50 
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CCCATGGAAA 60 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 60 base pairs 
5 (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 



CAGGTTCTCC TTCCCCAGGC TCCCAGGCTG TGGGCTGCCC CTCGGTCTCT 50 
10 TTCCATGGGC 60 
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SEQUENCE LISTING 



NERAL INFORMATION: 
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(i) APPLICANT: Caras, Ingrid W 

(ii) TITLE OF INVENTION: A2-1 Neurotrophic Factor 

(iii) NUMBER OF SEQUENCES: 8 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Genentech, Inc. 

' (B) STREET: 460 Point San Bruno Blvd 

(C) CITY: South San Francisco 

(D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 94080 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3.5 inch, 1.44 Mb floppy 
15 (B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: WinPatin (Genentech) 



disk 



20 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/635130/ 

(B) FILING DATE: 19-Mar-1996 

( C ) CLASSIFICATION : 



25 



(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Torchia, PhD., TiAiothy E. 

(B) REGISTRATION NUMBER: ^6,700 

(C) REFERENCE /DOCKET NUMBER: P1001 



(ix) TELECOMMUNICATION INFORMATION : 

(A) TELEPHONE: 415/225-8674 

(B) TELEFAX: 415 / 952 / -9 881 

(C) TELEX: 910/371/7168 

3 0 (2) INFORMATION FOR SEO/lD NO : 1 : 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3:877 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ix) FEATURE 

(A) NAME /KEY : Extra Cellular Domain 
(BK/LOCATION: 244-899 
(OJ IDENTIFICATION METHOD: 
)) OTHER INFORMATION: 



67 
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(ix) FEATURE: 

(A) NAME /KEY: Transmembrane Domain 

(B) LOCATION: 901-978 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 

(ix) FEATURE: 

(A) NAME/KEY: signal peptide 
(BJ LOCATION: 244-321 

(C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



GNTCTAGAAN TAGTGGATCC CCCCGGGCTG CAGGAATTCC 
GGAAGGGCTC TGGTGGGGCT GAGCGCTCTG CCGCGGGGGC 
CAGGAAGCAG GTCCGCGTGG GCGCTGGGGG CATCAGCTAC 
15 CGGGCTGAAG AGCCAGGCAG CCAAGGCAGC CACCCOZGGG 
TTTGGGGGAG TTGGTGCCCC GCCCCCCAGG CCTJTOGCGGG 



GACGGOCCCT 50 

fGGCACAG 100 

CGGGGTGGTC 150 

GGTGGGCGAC 200 

GTC ATG 246 
Met 
1 



GGG CCC 
20 Gly Pro 



GCC CTG 
Ala Leu 
15 



CCC CAT 
Pro His 
5 

CTG CTG 
Leu Leu 



TCT GGG CCG 
Ser Gly Pro 



CTG GGG/GTT 
Leu GW Val 
^0 



GGG GGC GTG CGA 
ly Gly Val Arg 
10 

TTG GGG CTG GTG 
Leu Gly Leu Val 
25 



GTC GGG 285 
Val Gly 




25 CTC AGC 
Leu Ser 



CTG GAG 
Leu Glu 
30 



CJZT GTC TAC 
ro Val Tyr 



TGG AAC TCG GCG 
Trp Asn Ser Ala 
35 



AAT AAG 3 63 
Asn Lys 
40 



30 



AGG TTC 
Arg Phe 



CAG/GCA 
Gl/ Ala 



GAG GGT GGT 
Glu Gly Gly 
45 



TAT GTG CTG TAC 
Tyr Val Leu Tyr 
50 



CCT CAG 402 
Pro Gin 



ATC 
He 



GGC 



GAC CGG 
Asp Arg 



'55 



CTA GAC CTG 
Leu Asp Leu 
60 



CTC TGC CCC CGG 
Leu Cys Pro Arg 



GCC CGG 441 
Ala Arg 
65 



COT CCT 
3 5 Dro Pro 



GGC CCT 
Gly Pro 
70 



CAC TCC TCT 
His Ser Ser 



CCT AAT TAT GAG 
Pro Asn Tyr Glu 
75 



TTC TAC 480 
Phe Tyr 



AAG CTG TAC CTG GTA GGG GGT GCT CAG GGC CGG CGC TGT 519 



68 





Lys Leu Tyr 
80 



Leu Val Gly Gly 
85 



Ala Gin Gly Arg Arg Cys 
90 



GAG GCA CCC 
Glu Ala Pro 
5 95 



CCT GCC CCA AAC 
Pro Ala Pro Asn 



CTC CTT CTC ACT 
Leu Leu Leu Thr 
100 



TGT GAT 558 
Cys Asp 
105 



CGC CCA GAC 
Arg Pro Asp 



CTG GAT CTC CGC 
Leu Asp Leu Arg 
110 



TTC ACC ATC AAG 
Phe Thr lie Lys 
115 



TTC CAG 597 
Phe Gin 



GAG TAT AGC CCT AAT CTC TGG 
10 Glu Tyr Ser Pro Asn Leu Trp 
120 125 



GGC CAC GAG TTC 
Gly His Glu Phe 



CGC TCG 63 < 
Arg Ser 
130 



CAC CAC GAT TAC TAC ATC ATT GCC ACA TCG GAT GGG 
His His Asp Tyr Tyr lie lie Ala Thr Ser Asp Gly, 
135 140 



675 



15 CGG GAG GGC CTG GAG AGC CTG 
Arg Glu Gly Leu Glu Ser Leu 
145 150 



CAG GGA GGT GTG/ 
Gin Gly Gly Vc 
55 



TGC CTA 714 
Cys Leu 



ACC AGA GGC ATG AAG GTG CTT 
Thr Arg Gly Met Lys Val Leu 
20 160 



CTC CGA GTG GGA 
Leu Arg/val Gly 
165 



CAA AGT 753 
Gin Ser 
170 



CCC CGA GGA 
Pro Arg Gly 



GGG GCT GTC CCC 
Gly Ala Val Pro 
175 



CGA/ AAA CCT GTG 
Aj/g Lys Pro Val 
180 



TCT GAA 792 
Ser Glu 



ATG CCC ATG 
2 5 Met Pro Met 
185 



GAA AGA GAC CiSA 
Glu Arg Asp yArg 
190 



GGG GCA GCC CAC 
Gly Ala Ala His 



AGC CTG 831 
Ser Leu 
195 



GAG CCT GGG 
Glu Pro Gly 



3 0 AAT GCA ACC 
Asn Ala Thr 
210 



AAG GAG AAC CTG 
Lys Glu/ Asn Leu 
200 

TCC/CGG GGT GCT 
Se>r Arg Gly Ala 
215 



CCA GGT GAC CCC 
Pro Gly Asp Pro 
205 

GAA GGC 1CCC 
Glu Gly VPro 



1 ^ 

CTG 

Leu 
220 



ACC AGC 870 
Thr Ser 



1* 6 k ^ * ^ 
CCC CCT 909 



Pro Pro 



CCC AGC 
Pro Ser Me 
35 225 



CCT GCA GTG GCT 
Pro Ala Val Ala 



GGG GCA GCA GGG 
Gly Ala Ala Gly 
230 



GGG CTG 948 
Gly Leu 
235 



GCG CT(S CTC 
Ala Leu Leu 



TTG CTG GGC GTG 
Leu Leu Gly Val 
240 



GCA GGG GCT GGG 
Ala Gly Ala Gly 
245 



GGT GCC 987 
Gly Ala 



ATS TGT TGG CGG AGA CGG CGG GCC AAG CCT TCG GAG AGT 1026 



69 



Met 



Cys Trp 
250 



Arg Arg Arg 



Arg Ala Lys Pro 
255 



Ser Glu Ser 
260 



CGC 
Arg 



CAC CCT 
His Pro 



GGT CCT GGC 
Gly Pro Gly 
265 



TCC TTC GGG AGG 
Ser Phe Gly Arg 
270 



GGA GGG TCT 1065 
Gly Gly Ser 



CTG 
Leu 
275 



GGC CTG 
Gly Leu 



GGG GGT GGA 
Gly Gly Gly 
280 



GGT GGG ATG GGA 
Gly Gly Met Gly 



CCT CGG GAG 1104 
Pro Arg Glu 
285 



GCT 
10 Ala 



GAG CCT 
Glu Pro 
290 



GGG GAG CTA 
Gly Glu Leu 



GGG ATA GCT CTG 
Gly He Ala Leu 
295 



CGG GGT GGC 1143 
Arg Gly Gly 
300 



GGG 
Gly 



GCT GCA 
Ala Ala 



GAT CCC CCC 
Asp Pro Pro 
305 



TTC TGC CCC CAC 
Phe Cys Pro His 
310 



TAT GAG AAG 1182 
Tyr Glu Lys 



15 GTG 
Val 



AGT GGT 
Ser Gly 
315 



GAC TAT GGG 
Asp Tyr Gly 



CAT CCT GTG TAT 
His Pro Val Tyr 
320 



ATC GTG CAG 3^221 
He Val Gli 
325 



20 



GAT 
Asp 



GGG CCC 
Gly Pro 



CCC CAG AGC 
Pro Gin Ser 
330 



CCT CCA AAC ATC 
Pro Pro Asn He 
335 



TAC TAC? ACA 1260 
Tyr Tyr Thr 



TCG 
Ser 
340 



ATT TCT 
He Ser 



GTG TTG GAG 
Val Leu Glu 
345 



TGG CCC ATA TTG 
Trp Pro He Lei 



iT ACG ATA 1299 
His Thr He 
350 



CAA CTG TTT TTC ATG CGA TCC AAG TGCT TCC CGT GTC ACT 133 8 
25 Gin Leu Phe Phe Met Arg Ser Lys Cys Ser Arg Val Thr 
355 360/ 365 



ACA 
Thr 



TTC TTA 
Phe Leu 



TTT CCT GTG 
Phe Pro Val 
370 



CAA G*rT 
Gln/val 



ATT ACG 
He Thr 
375 



ACA TCG ACT 1377 
Thr Ser Thr 



3 0 TGC CGG ATG ACT TCA TTy AGC TTT ACC ACC CTG AAC CCA 1416 
Cys Arg Met Thr Ser PJ?fe Ser Phe Thr Thr Leu Asn Pro 
380 / 385 390 



35 



TCC 
Ser 



ATG CAG 
Met Gin 



GCC TGC AGA 
Ala/Cys Arg 



GCA CAG 
Ala Gin 



ATG GGG 
Met Gly 
400 



GAA TTC CGA 1455 
Glu Phe Arg 



ATC 
He 
405 



AGA TGG 
Arg Ti 



TGT TTC TGG 
Cys Phe Trp 
410 



GGG GAC AGG ATC CTG GGT ACG 1494 
Gly Asp Arg He Leu Gly Thr 
415 



GCT CTG TTT GTG CTT GTG CTT ATT CTT CTT CTT GGG AGG 1533 



70 



t 



♦ 



Ala Leu Phe Val Leu Val Leu lie Leu Leu Leu Gly Arg 
420 425 430 



CTG AAT ATG CAT CAG ACG ACA CTG CTC CGG CAA 
Leu Asn Met His Gin Thr Thr Leu Leu Arg Gin 
5 435 440 



CGG GCC 1572 
Arg Ala 



AGT GTG GAG GCG GAA GCC GGC CAG CAT GGT CCC CTG TG 1610 
Ser Val Glu Ala Glu Ala Gly Gin His Gly Pro Leu 
445 450 455 



ATAGGATTGA AAGAGCTACT GAGAATAGGG GGCTTCTCAA 
10 GAGGCTGCTG TTATCATGGG AACCAGGCAG ATCAATCATC 
CAGGCAGGAA GTTACTTAGC TTCTCCTTCA CCTTCTTCO 
TTATAGGCTT GTTCCAAGTT GTAGTGTGTG ATCAGAJITCG 
CAGCTCTGTG CTACCTGGCA GTTCCCCTCA TGC^ATTCGA 
ATCGATACCG TCGACCT 1877 
15 (2) INFORMATION FOR SEQ ID NO 

(i) SEQUENCE CHARACTERISJFTCS : 

(A) LENGTH: 455 amino acids 

(B) TYPE: Amino A^id 
(D) TOPOLOGY: Linear 



VGCG 1660 
CCtfGGCAGGT 1710 
ACAGAATTTA 1760 
TGCTGCCTGT 1810 
TATCAAGCTT 1860 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



Met Gly Pro Pro Mis Ser Gly Pro Gly Gly Val 
1 / 5 10 



Arg Val Gly Ala 
15 



Leu Leu Leu ^eu Gly Val Leu Gly Leu Val Ser 
20 25 



Gly Leu Ser Leu 
30 



25 Glu Pro )fal Tyr Trp Asn Ser Ala Asn Lys Arg 

35 40 



Phe Gin Ala Glu 
45 



Gly g/ly Tyr Val Leu Tyr Pro Gin lie Gly Asp 
50 55 



Arg Leu Asp Leu 
60 



30 



Cys Pro Arg Ala Arg Pro Pro Gly Pro His 
65 70 



Ser Ser Pro Asn 
75 



Tyr Glu Phe Tyr Lys Leu Tyr Leu Val Gly Gly 
80 85 



Ala Gin Gly Arg 
90 



Arg Cys Glu Ala Pro Pro Ala Pro Asn Leu Leu 
95 100 



Leu Thr Cys Asp 
105 



71 



Arg Pro Asp Leu Asp Leu Arg Phe Thr lie Lys Phe Gin Glu Tyr 
110 115 120 

Ser Pro Asn Leu Trp Gly His Glu Phe Arg Ser His His Asp Tyr 
125 130 135 

5 Tyr lie lie Ala Thr Ser Asp Gly Thr Arg Glu Gly Leu Glu Ser 

140 145 150. 

Leu Gin Gly Gly Val Cys Leu Thr Arg Gly Met Lys Val Leu 

155 160 /165 

Arg Val Gly Gin Ser Pro Arg Gly Gly Ala Val Pro Arg /Lys Pro 
10 170 175 / 180 

Val Ser Glu Met Pro Met Glu Arg Asp Arg Gly Ala/Ala His Ser 
185 190 / 195 

Leu Glu Pro Gly Lys Glu Asn Leu Pro Gly Aspr Pro Thr Ser Asn 
200 205 / 210 

15 Ala Thr Ser Arg Gly Ala Glu Gly Pro Leu Pro Pro Pro Ser Met 

215 J120 225 

Pro Ala Val Ala Gly Ala Ala Gly GZy Leu Ala Leu Leu Leu Leu 
230 / 235 240 

Gly Val Ala Gly Ala Gly Gly Ala Met Cys Trp Arg Arg Arg Arg 
20 245 / 250 255 

Ala Lys Pro Ser Glu Ser Arg His Pro Gly Pro Gly Ser Phe Gly 
260 / 265 270 

Arg Gly Gly Ser Leu ply Leu Gly Gly Gly Gly Gly Met Gly Pro 
275/ 280 285 

25 Arg Glu Ala Glu /ro Gly Glu Leu Gly lie Ala Leu Arg Gly Gly 

290 295 300 

Gly Ala Ala ^Xsp Pro Pro Phe Cys Pro His Tyr Glu Lys Val Ser 
305 310 315 

Gly Asp tfyr Gly His Pro Val Tyr lie Val Gin Asp Gly Pro Pro 
30 / 320 325 330 

Gin §fer Pro Pro Asn lie Tyr Tyr Thr Ser lie Ser Val Leu Glu 
335 340 345 

T/p Pro lie Leu His Thr lie Gin Leu Phe Phe Met Arg Ser Lys 
350 355 360 



35/ Cys Ser Arg Val Thr Thr Phe Leu Phe Pro Val Gin Val lie Thr 

72 



365 



370 



375 



Thr Ser Thr Cys Arg Met Thr Ser Phe Ser Phe Thr Thr Leu Asn 
380 385 390 

Pro Ser Met Gin Ala Cys Arg Ala Gin Met Gly Glu Phe Arg lie 
5 395 400 405 

Arg Trp Cys Phe Trp Gly Asp Arg lie Leu Gly Thr Ala Leu 

410 415 yf20 

Val Leu Val Leu lie Leu Leu Leu Gly Arg Leu Asn Met Bis Gin 
425 430 / 435 

10 Thr Thr Leu Leu Arg Gin Arg Ala Ser Val Glu Ala yGlu Ala Gly 

440 445 / 450 

Gin His Gly Pro Leu 
455 

(2) INFORMATION FOR SEQ ID NO : 3 : 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2380 base pairs/ 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 



gntctagaan 
ggaagggctc 
caggaagcag 
cgggctgaag 
25 tttgggggag/ 
ccccccattc 
ggggt/ttgg 
ctcggcgaat 
3atcgggga 
30 / cctcactcct 

TGCTCAGGGC 



TAGTGGATCC 

TGGTGGGJ^CT 

GTCCQCGTGG 

AGCCAGGCAG 

TTGGTGCCCC 

TGGGCCGGGG 

GGCTGGTGTC 

AAGAGGTTCC 

CCGGCTAGAC 

CTCCTAATTA 

CGGCGCTGTG 



CCCCGGGCTG 
GAGCGCTCTG 
GCGCTGGGGG 
CCAAGGCAGC 
GCCCCCCAGG 
GGCGTGCGAG 
TGGGCTCAGC 
AGGCAGAGGG 
CTGCTCTGCC 
TGAGTTCTAC 
AGGCACCCCC 



CAGGAATTCC 
CCGCGGGGGC 
CATCAGCTAC 
CACCCCGGGG 
CCTTGGCGGG 
TCGGGGCCCT 
CTGGAGCCTG 
TGGTTATGTG 
CCCGGGCCCG 
AAGCTGTACC 
TGCCCCAAAC 
73 



GACGGCCCCT 50 
GCGGGCACAG 100 
CGGGGTGGTC 150 
GGTGGGCGAC 200 
GTCATGGGGC 250 
GCTGCTGCTG 3 00 
TCTACTGGAA 3 50 
CTGTACCCTC 400 
GCCTCCTGGC 450 
TGGTAGGGGG 500 
CTCCTTCTCA 550 



CTTGTGATCG CCCAGACCTG GATCTCCGCT TCACCATCAA GTTCCAGGAG 600 
TATAGCCCTA ATCTCTGGGG CCACGAGTTC CGCTCGCACC ACGATTACTA 650 
CATCATTGCC ACATCGGATG GGACCCGGGA GGGCCTGGAG AGCCTGCAGG 7 00 
GAGGTGTGTG CCTAACCAGA GGCATGAAGG TGCTTCTCCG AGTGGGACAA 750 
5 AGTCCCCGAG GAGGGGCTGT CCCCCGAAAA CCTGTGTCTG AAATGCCCAT 800 
GGAAAGAGAC CGAGGGGCAG CCCACAGCCT GGAGCCTGGG AAGGAGAACC 850 
TGCCAGGTGA CCCCACCAGC AATGCAACCT CCCGGGGTGC TGAAGGCCCC 900 
CTGCCCCCTC CCAGCATGCC TGCAGTGGCT GGGGCAGCAG GGGGGCTG0C 950 
GCTGCTCTTG CTGGGCGTGG CAGGGGCTGG GGGTGCCATG TGTTG^CGGA 1000 
10 GACGGCGGGC CAAGCCTTCG GAGAGTCGCC ACCCTGGTCC TGGCTCCTTC 1050 
GGGAGGGGAG GGTCTCTGGG CCTGGGGGGT GGAGGTGGGa/tGGGACCTCG 1100 
GGAGGCTGAG CCTGGGGAGC TAGGGATAGC TCTGCGGGGT GGCGGGGCTG 1150 
CAGATCCCCC CTTCTGCCCC CACTATGAGA AGGT^AGTGG TGACTATGGG 1200 
CATCCTGTGT ATATCGTGCA GGATGGGCCC CJZfCCAGAGCC CTCCAAACAT 12 50 
15 CTACTACAAG GTATGAGGGC TCCTCTCACy TGGCTATCCT GAATCCAGCC 13 00 
CTTCTTGGGG TGCTCCTCCA GTTTAAT^TCC TGGTTTGAGG GACACCTCTA 1350 
ACATCTCGGC CCCCTGTGCC CCCCCZAGCCC CTTCACTCCT CCCGGCTGCT 1400 
GTCCTCGTCT CCACTTTTAG GAfTTCCTTAG GATTCCCACT GCCCCACTTC 1450 
CTGCCCTCCC GTTTGGCCAT VGGGTGCCCCC CTCTGTCTCA GTGTCCCTGG 1500 
20 ATCCTTTTTC CTTGGGGAGG GGCACAGGCT CAGCCTCCTC TCTGACCATG 1550 
ACCCAGGCAT CCTTG^CCCC CTCACCCACC CAGAGCTAGG GGCGGGAACA 1600 
GCCCACCTTT TGG^TGGCAC CGCCTTCTTT CTGCCTCTCA CTGGTTTTCT 1650 
CTTCTCTATC TCTTATTCTT TCCCTCTCTT CCGTCTCTAG GTCTGTTCTT 1700 
CTTCCCTAGC ATCCTCCTCC CCACATCTCC TTTCACCCTC TTGGCTTCTT 17 50 
25 ATCCTGJTGCC TCTCCCATCT CCTGGGTGGG GGCATCAAAG CATTTCTCCC 1800 
CTT#GCTTTC AGCCCCCCTT CTGACCTCTC ATACCAACCA CTCCCCTCAG 1850 



74 



TCTGCCAAAA ATGGGGGCCT TATGGGGAAG GCTCTGACAC 
CTCAGGCCAT GGGCAGCAGG GCTCCATTCT CTGGCCTGGC 
ACATACTTAC TCCAGCCATT TGGGGTGGTT GGGTCATGAC 
AGAAGAAGTG TCCCGTTTTG TCCAGTGGCC AATAGCAAGA 
5 TCGGGACATG TATGGACTTG GTCTGATGCT GAATGGGCCA 
GAAGTGACTT GCTCCAGACA AGAGGTGACC AGGCCCGGAC 
TGGGAAGTAG CAGAAGCAGT GCAGCAGGAA CTGGAAGTGC 
GACAGGAAGT AGCACTTCTG AAACAGGAAG TGGTCTGGCT 
GTGGCTTAGT CTGGGGGATC AGGAGGTGGG AGGTGGATGG 
10 GTGGAGAAGA AGGGCGGGAA GAACTTCCTT TCAGGAGGi 
ACTGACTGTA AGAGGTTAGA GGTGGACCGA 23 80 
(2) INFORMATION FOR SEQ ID NO: 4: 



TCCACCCCAG 1900 
CCAGGCCTCT 1950 
AGCTACCATG 2000 
TATGAACCGG 2050 
CTTGGGACCG 2100 
AGAAATGGCC 2L50 
CTTCATCCAJrf 2200 
GGAACT^CAA 2250 
TTQTTATTCT 2300 
GCTGGAACTT 2350 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 340 amino acid^ 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION j/SEQ ID NO : 4 : 

Met Gly Pro Pro His Ser jSly Pro Gly Gly Val 
1 5 / 10 



Arg Val Gly Ala 
15 



20 Leu Leu Leu Leu Gly^al Leu Gly Leu Val Ser 

2(/ 25 



Gly Leu Ser Leu 
30 



Glu Pro Val Tyr /Trp Asn Ser Ala Asn Lys Arg 
35 40 



Phe Gin Ala Glu 
45 



25 



Gly Gly Tyr /al Leu Tyr Pro Gin lie Gly Asp 
50 55 



Arg Leu Asp Leu 
60 



Leu Cys IJro Arg Ala Arg Pro Pro Gly Pro His 
65 70 



Ser Ser Pro Asn 
75 



Tyr Gpi Phe Tyr Lys Leu Tyr Leu Val Gly Gly 
80 85 



Ala Gin Gly Arg 
90 



30 Ar^g Cys Glu Ala Pro Pro Ala Pro Asn Leu Leu 

95 100 



Leu Thr Cys Asp 
105 



75 



Arg Pro Asp Leu Asp Leu Arg Phe Thr lie Lys Phe Gin Glu Tyr 

110 115 120 

Ser Pro Asn Leu Trp Gly His Glu Phe Arg Ser His His Asp Tyr 

125 130 135 

5 Tyr lie lie Ala Thr Ser Asp Gly Thr Arg Glu Gly Leu Glu Ser 

140 145 150 

Leu Gin Gly Gly Val Cys Leu Thr Arg Gly Met Lys Val Leu Leuy 

155 160 161 

Arg Val Gly Gin Ser Pro Arg Gly Gly Ala Val Pro Arg Lys/Pro 

10 170 175 / 180 

Val Ser Glu Met Pro Met Glu Arg Asp Arg Gly Ala Ala/ His Ser 

185 190 / 195 

Leu Glu Pro Gly Lys Glu Asn Leu Pro Gly Asp Ppo Thr Ser Asn 

200 205 / 210 

15 Ala Thr Ser Arg Gly Ala Glu Gly Pro Leu Pifo Pro Pro Ser Met 

215 220 / 225 

Pro Ala Val Ala Gly Ala Ala Gly Gly Leu Ala Leu Leu Leu Leu 

230 235 240 

Gly Val Ala Gly Ala Gly Gly Ala Met Cys Trp Arg Arg Arg Arg 

20 245 / 250 255 

Ala Lys Pro Ser Glu Ser Arg Hi/ Pro Gly Pro Gly Ser Phe Gly 

260 / 265 270 

Arg Gly Gly Ser Leu Gly Lej/ Gly Gly Gly Gly Gly Met Gly Pro 

275 / 280 285 

25 Arg Glu Ala Glu Pro Gl/ Glu Leu Gly lie Ala Leu Arg Gly Gly 

290 / 295 300 

Gly Ala Ala Asp Pre/ Pro Phe Cys Pro His Tyr Glu Lys Val Ser 

3/5 310 315 

Gly Asp Tyr GlV His Pro Val Tyr lie Val Gin Asp Gly Pro Pro 

30 / 320 325 330 

Gin Ser Pro ^Pro Asn lie Tyr Tyr Lys Val 
335 340 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 
/(A) LENGTH: 454 base pairs 



76 



(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



5 GCCGACGCTG TGAGGCACCC CCTGCCCCAA ACCTCCTTCT 

CGCCCAGACC TGGATCTCCG CTTCACCATC AAGTTCCAGG 

TAATCTCTGG GGCCACGAGT TCCGCTCGCA CCACGATTAC 

CCACATCGGA TGGGACCCGG GAGGCCTGGG AGAGCCTGCA 

TGCCTAACCA GAGGCATGAA GGTGCTTCTC CGAGTNGGAC 

10 GGAGGGGCTG TCCCCCGAAA ACCTGTGTCT GAAATGCCCi 

CCGAGGGGCA GCCCACAGCC TGGGAGCCTG GGGAAGG^GA 

TGACCCCACC AGCAATNCAA CCTTCCGGGG TTGC^TGAAG 

CCCTTTCCCA GCATTGCNTG CANTTGGTTN GQGGCAGCAN 
TGGC 454 

15 (2) INFORMATION FOR SEQ ID NO: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino/acids 

(B) TYPE: Amino Acij^ 
(D) TOPOLOGY: Linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Asp Lys Thr His Thr/Cys Pro Pro Cys Pro 

10 

(2) INFORMATION FbR SEQ ID NO : 7 : 

( i ) SEQUENCE CHARACTERISTICS : 
25 (A) LENGTH : 60 base pairs 

(B) T>YPE: Nucleic Acid 

(C) /STRANDEDNESS: Single 
(D/ TOPOLOGY: Linear 

(xi) /SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 



CACTTGTGAT 50 
AGTATAGCCC 1<00 
TACATCATTG 150 
GGGAAGTGTG 2 00 
TCCCGA 250 
'TGGAAAGAGA 3 00 
ACCTGCCAGG 350 
GGCCCCTTGA 400 
GGGGGNGTTT 450 



30 GGACAAAGTC CCGAGGAGGG GCTGTCCCCC GAAAACCTGT GTCTGAAATG 50 
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Fikle . DKB2 : [MANDL] HS0O6l5PFEMESTl ; 1 

ID HS006163 standard; RNA; EST; 454 *3P. 
XX 

AC H10006; 
XX 

NI g874828 
XX 

DT 02-JUL-1995 (Rel . 44, Created) 

DT 02-JUL-1995 (Rel. 44, Last updated, Version 1) 

XX 

DE ym02ell.rl Homo sapiens cDNA clone 46610 5' similar to contains Alu 

DE repetitive element;. 

XX 

KW EST . 
XX 

OS Homo sapiens (human) 

OC Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 

0C Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

RN [1] 

RP 1-454 

RA Hillier L., Clark N., Dubuque T., Elliston K., Hawkins M. , 

RA Holman M. , Hultman M. , Kucaba T., Le M., Lennon G., Marra M. , 

RA Parsons J., Rifkin L., Rohlfing T., Soares M. , Tan F., 

RA Trevaskis E., Waterston R., Williamson A., Wohldmann P., Wilson R.; 

RT "The WashU-Merck EST Project"; 

RL Unpubl ished . 

XX 

CC GDB: G00-419-151 Contact: Wilson RK WashU-Merck EST Project 

CC Washington University School of Medicine 4444 Forest Park Parkway, 

CC Box 8501, St. Louis, MO 63108 Tel: 314 286 1800 Fax: 314 286 1810 

CC Email: est@watson.wustl.edu High quality sequence stops: 313 

CC Source: IMAGE Consortium, LLNL This clone is available royalty-free 

CC through LLNL ; contact the IMAGE Consortium (info@image.llnl.gov) 

CC for further information. NCBI gi: 874828 

XX 

p H Key Location/Qualifiers 
FH 

FT source l. .454 

?T /organism="Homo sapiens" 

?T /clone="46610'» 

?T /note= "human" 

CX 

3Q Sequence 454 BP; 93 A; 137 C; 133 G; 84 T; 7 other; 
HS006163 Length: 454 August 19, 1997 11:01 Type: N Check: 5608 .. 
1 GCCGACGCTG TGAGGCACCC CCTGCCCCAA ACCTCCTTCT CACTTGTGAT 
51 CGCCCAGACC TGGATCTCCG CTTCACCATC AAGTTCCAGG AGTATAGCCC 
101 TAATCTCTGG GGCCACGAGT TCCGCTCGCA CCACGATTAC TACATCATTG 
151 CCACATCGGA TGGGACCCGG GAGGCCTGGG AGAGCCTGCA GGGAAGTGTG 
201 TGCCTAACCA GAGGCATGAA GGTGCTTCTC CGAGTNGGAC AAAGTCCCGA 



251 GGAGGGGCTG TCCctfcAAA ACCTGTGTCT GAAATGCCC^^GGAAAGAGA 

301 CCGAGGGGCA GCCCACAGCC TGGGAGCCTG GGGAAGGAGA ACCTGCCAGG 

351 TGACCCCACC AGCAATNCAA CCTTCCGGGG TTGCTTGAAG GGCCCCTTGA 

401 CCCTTTCCCA GCATTGCNTG CANTTGGTTN GGGGCAGCAN GGGGGNGTTT 

451 TGGC 



rurR? • TMANDLl U57001 . EMHUM2 ; 1 

1 d hsS?06i standard; RNA; HUM; 1708 BP. 

X 

,C U57001; 
[X 

II gl469781 

vr -n-JUL-1996 (Rel. 48, Created) . 

£ 3^-1996 (Rel. 48, Last updated, Versxon 1) 



XX 



_ Human ligand for eph-related receptor tyrosine kinases (EPUS8) 

DE mRNA, complete cds. 

XX 
KW 
XX 

OS Homo sapiens (human) Mllearvote s- Metazoa; Chordata; 

£ SSSSf jSjSSTSSirW-l; Ho.inidae; Homo. 

XX 

RN [1] 

RP 1-1708 

n "till-" ° Ugand for the eph-related receptor tyrosine kinases" , 

RL Unpubl i shed . 

XX 

RN [2] 

RP 1-1708 

ra Cerretti D.P. ; 

57 £ hm ^ r pH f29-APR-1996) to the EMBL/GenBank/DDBJ databases. 

S SSSSi/SxJSEtif'^lecularBlolw. l-unex Corp . , 51 

RL SXSersity St., Seattle, WA 98101, USA 
XX 

DR SPTREMBL; Q15768; Q15768 . 



XX 
FH 
FH 
FT 
FT 
FT 



Key Location/Qualifiers 

source 1. .1708 

/organism="Homo sapiens" TTTnon , „ 
/clone_lib="Clontech number HL300ia 



FT sis peptide 398. .466 

fi sig_y v /gene="EPLG8» 

CDS 398. .1420 

IL /gene="EPLG8» 

/note="LERK-8" 
FT /codon_start=l 



/product="ligand for eph-related receptor tyrosine kinase : 
PT /db_xref="PID:gl469782« 
S ^nflatlonlS^ 

pT GGYVLYPQ IGDRLDLLCPRARPPGPHS S PNYEFYKLYLVGGAQGRRCEAPPAPNLLL 

DRPDLDLRFTIKFQEYSPNLWGHEFRSHHDYYIIATSDGTREGLESLOGGVCLTRGM 



\ FT* 



FT 



FT 



LLRV^PRGGAVPRKPVSEMPMERDRGAAH^PGKENLPGDPTSNATSRGAEGPLPl 

PSMPAVAGAAGGLALLLLGVAGAGGAMCWRRRRAKPSESRHPGPGSFGRGGSLGLGGGC 
GMGPREAEPGELGIALRGGGAADPPFCPHYEKVSGDYGHPVYIVQDGPPQSPPNIYYIC 



FT 
FT 
FT 
FT 
XX 



misc_feature 



1070. .1150 
/gene="EPLG8" 

/note=" encodes transmembrane domain" 



SQ Sequence 1708 BP; 244 A; 563 C; 572 G; 329 T; 0 other; 
U57001 Length: 1708 August 20, 1997 12:03 Type: N Check: 4485 .. 
1 AATTCCGGCC CTTAGCCCGC TGCCCTCAAT CCCAGCGAGG CTGGGGCTCC 
51 GGCTCGGCGC CCCCTTCCTC GCTCCCTGGT CCGGCGCCCC ATGCCGCCCC 
101 CGCCCGGTCC CCGGCTCCCC CAGTCCCCCA CTTAGGCGGG CTCACAGATC 
151 CCGGGGTGCT GGCGCGTGGG CCGGGGGCGC GTAGGGCGCC TGCAGACGGC 
201 CCCTGGAAGG GCTCTGGTGG GGCTGAGCGC TCTGCCGCGG GGGCGCGGGC 
251 ACAGCAGGAA GCAGGTCCGC GTGGGCGCTG GGGGCATCAG CTACCGGGGT 

301 GGTCCGGGCT GAAGAGCCAG GCAGCCAAGG CAGCCACCCC GGGGGGTGGG 

351 CGACTTTGGG GGAGTTGGTG CCCCGCCCCC CAGGCCTTGG CGGGGTCATG 

401 GGGCCCCCCC ATTCTGGGCC GGGGGGCGTG CGAGTCGGGG CCCTGCTGCT 

451 GCTGGGGGTT TTGGGGCTGG TGTCTGGGCT CAGCCTGGAG CCTGTCTACT 

501 GGAACTCGGC GAATAAGAGG TTCCAGGCAG AGGGTGGTTA TGTGCTGTAC 

551 CCTCAGATCG GGGACCGGCT AGACCTGCTC TGCCCCCGGG CCCGGCCTCC 

601 TGGCCCTCAC TCCTCTCCTA ATTATGAGTT CTACAAGCTG TACCTGGTAG 

651 GGGGTGCTCA GGGCCGGCGC TGTGAGGCAC CCCCTGCCCC AAACCTCCTT 

701 CTCACTTGTG ATCGCCCAGA CCTGGATCTC CGCTTCACCA TCAAGTTCCA 

751 GGAGTATAGC CCTAATCTCT GGGGCCACGA GTTCCGCTCG CACCACGATT 

801 ACTACATCAT TGCCACATCG GATGGGACCC GGGAGGGCCT GGAGAGCCTG 

851 CAGGGAGGTG TGTGCCTAAC CAGAGGCATG AAGGTGCTTC TCCGAGTGGG 

901 ACAAAGTCCC CGAGGAGGGG CTGTCCCCCG AAAACCTGTG TCTGAAATGC 

951 CCATGGAAAG AGACCGAGGG GCAGCCCACA GCCTGGAGCC TGGGAAGGAG 

1001 AACCTGCCAG GTGACCCCAC CAGCAATGCA ACCTCCCGGG GTGCTGAAGG 

1051 CCCCCTGCCC CCTCCCAGCA TGCCTGCAGT GGCTGGGGCA GCAGGGGGGC 



,-*i01 TGGCGCTGCT CTlf^GGGC GTGGCAGGGG CTGGGGGT^|cATGTGTTGG 

1151 CGGAGACGGC GGGCCAAGCC TTCGGAGAGT CGCCACCCTG GTCCTGGCTC 

1201 CTTCGGGAGG GGAGGGTCTC TGGGCCTGGG GGGTGGAGGT GGGATGGGAC 

1251 CTCGGGAGGC TGAGCCTGGG GAGCTAGGGA TAGCTCTGCG GGGTGGCGGG 

1301 GCTGCAGATC CCCCCTTCTG CCCCCACTAT GAGAAGGTGA GTGGTGACTA 

1351 TGGGCATCCT GTGTATATCG TGCAGGATGG GCCCCCCCAG AGCCCTCCAA 

1401 ACATCTACTA CAAGGTATGA GGGCTCCTCT CACGTGGCTA TCCTGAATCC 

1451 AGCCCTTCTT GGGGTGCTCC TCCAGTTTAA TTCCTGGTTT GAGGGACACC 

1501 TCTAACATCT CGGCCCCCTG TGCCCCCCCA GCCCCTTCAC TCCTCCCGGC 

1551 TGCTGTCCTC GTCTCCACTT TTAGGATTCC TTAGGATTCC CACTGCCCCA 

1601 CTTCCTGCCC TCCCGTTTGG CCATGGGTGC CCCCCTCTGT CTCAGTGTCC 

1651 CTGGATCCTT TTTCCTTGGG GAGGGGCACA GGCTCAGCCT CCTCTCTGAC 

1701 CATGCCGG 



